An Idea of a Rough Set Theory Based Document Classification System
نویسندگان
چکیده
منابع مشابه
Web Document Classification Based on Rough Set
For traditional way of Web document representation in Vector Space Model, zero-valued similarity problem between vectors occurs frequently, which decreases classificatory quality when defining the relation between Web documents. In this paper, a novel Web document representation and classification approach based on rough set is proposed. Firstly, TF*IDF weighting scheme is used to assign weight...
متن کاملOptimizing The Classification System Based On Rough Set Theory
Every day, we are observing that a wide-ranging amount of data and information is being stored every moment. Mostly the information is dynamic and transactional, and each time the data is being updated from time to time. Getting knowledge from this kind of huge and dynamic data is really becoming tough task with respect to time and efficiency. Fortunately we are served with the intelligent proc...
متن کاملRough set theory based prognostic classification models for hospice referral
BACKGROUND This paper explores and evaluates the application of classical and dominance-based rough set theory (RST) for the development of data-driven prognostic classification models for hospice referral. In this work, rough set based models are compared with other data-driven methods with respect to two factors related to clinical credibility: accuracy and accessibility. Accessibility refers...
متن کاملAnonymizing classification data using rough set theory
Identity disclosure is one of the most serious privacy concerns in many data mining applications. A wellknown privacy model for protecting identity disclosure is k-anonymity. The main goal of anonymizing classification data is to protect individual privacy while maintaining the utility of the data in building classification models. In this paper, we present an approach based on rough sets for m...
متن کاملA Study on Rough Set Theory Based Dynamic Reduct for Classification System Optimization
In the present day huge amount of data is generated in every minute and transferred frequently. Although the data is sometimes static but most commonly it is dynamic and transactional. New data that is being generated is getting constantly added to the old/existing data. To discover the knowledge from this incremental data, one approach is to run the algorithm repeatedly for the modified data s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Japan Society for Fuzzy Theory and Intelligent Informatics
سال: 2020
ISSN: 1347-7986,1881-7203
DOI: 10.3156/jsoft.32.4_778